Examining the Impact of ACE interference on Multi-Bit AVF Estimates

نویسندگان

  • Fritz Previlon
  • Mark Wilkening
  • Vilas Sridharan
  • Sudhanva Gurumurthi
  • David R. Kaeli
چکیده

Until recently, soft error reliability has been focused on single-bit errors and as a consequence, methodologies for Architectural Vulnerability Factor (AVF) analysis have been well defined and established for single-bit faults. However, studies have shown that multi-bit faults are becoming more prevalent with technology scaling [1]. If this trend continues, multi-bit faults will eventually become a high percentage of all microprocessor faults. Research into modeling methodologies for multi-bit faults is scarce. Recently, we presented an Architecturally Correct Execution (ACE) analysis methodology to evaluate the AVF of spatial multi-bit faults (MB-AVF) [2]. We used our methodology to study multi-bit AVF by extending the ACE analysis infrastructure of a performance simulator. While this methodology precisely measures AVF for Detected Unrecoverable Errors (DUEs), it only approximates AVF for Silent Data Corruptions (SDC). This is because the methodology determines a bit’s ACE state using a single-bit ACE analysis. However, a bit’s ACE state may change due to the presence of another bit flip, a condition termed ACE interference, and this effect will not be captured by the MB-AVF modeling methodology. As a result, our SDC calculation method is accurate only if ACE interference is rare for multi-bit faults. In this paper, we present a fault injection study to determine the prevalence of ACE interference in typical benchmarks executing on a GPU. Our results show that ACE interference is a rare event in GPUs: we found that the ACE state of a bit rarely changes in presence of other faults. These results support the conclusion that our multi-bit ACE analysis can accurately estimate the SDC AVF of a processor design.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bit Impact Factor: Towards making fair vulnerability comparison

Reliability is becoming a major design concern in contemporary microprocessors since soft error rate is increasing due to technology scaling. Therefore, design time system vulnerability estimation is of paramount importance. Architectural Vulnerability Factor (AVF) is an early vulnerability estimation methodology. However, AVF considers that the value of a bit in a clock cycle is either require...

متن کامل

Architectural Vulnerability Factors for Address - Based Structures

Processor designers require estimates of the architectural vulnerability factor (AVF) of on-chip structures to make accurate soft error rate estimates. AVF is the fraction of faults from alpha particle and neutron strikes that result in user-visible errors. This paper shows how to use a performance model to calculate the AVF of address-based structures, using a data cache, a data translation bu...

متن کامل

Adaptive multi-stage parallel interference cancellation receiver for multi-rate DS-CDMA system

In this letter, adaptive multi-stage parallel interference cancellation (PIC) receiver is considered for multi-rate DS-CDMA system. In each stage of the adaptive multi-stage PIC receiver, multiple access interference (MAI) estimates are obtained by the sub-bit estimates from the previous stage and the adaptive weights for the sub-bit estimates. The adaptive weights are obtained by minimizing th...

متن کامل

Objective function based group-wise successive interference cancellation receiver for dual-rate DS-CDMA system

In this paper, objective function based group-wise successive interference cancellation (GSIC) receiver is proposed for a dual-rate DS-CDMA system. In the receiver, user signals are divided into 2 groups for their data rates. Initial bit estimates for the users in the group 1 are obtained by its matched filter (MF) bank. The initial bit estimates for the group 1 users are fed into the multi-use...

متن کامل

Single-Threaded Mode AVF Prediction During Redundant Execution

Transient faults can lead to serious errors in execution. Providing protection for the processor core against these faults requires redundant execution, which leads to a performance loss. However, not all bit flips have equal impact on the processor. The Architectural Vulnerability Factor (AVF) quantifies when a soft error is likely to alter the final output and when it has little impact due to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015